The Internet Archive discovers and captures web pages through many different web crawls.
At any given time several distinct crawls are running, some for months, and some every day or longer.
View the web archive through the Wayback Machine.
Web wide crawl with initial seedlist and crawler configuration from October 2010
TIMESTAMPS
What You Should Be Seeing
What Your Browser Shows You
語
35486
솋
49547
あ
12354
Ж
01046
א
01488
Σ
00931
ฒ
03602
ش
01588
Գ
01331
ओ
02323
ຢ
03746
གྷ
03907
შ
04328
35486 - Chinese character for language. (Also 日本語)
49547 - Korean syllable syeyh(?).
12354 - Hiragana letter a.
01046 - Russian zh.
01488 - Hebrew aleph.
00931 - Greek sigma.
03602 - Thai letter tho phuthao.
01588 - Arabic letter sheen.
01331 - Armenian capital letter gim.
02323 - Devanagari letter o.
03746 - Lao letter yo.
03907 - Tibetan letter gha.
04328 - Georgian letter shin.
What Your Browser Shows You
Ѧ
01126
պ
01402
੫
02667
ಢ
03234
ณ
03603
ტ
04322
ቐ
04688
Ᏻ
05107
ᙤ
05732
蠧
34855
01126 - Cyrillic Yus.
01402 - Armenian small peh.
02667 - Gurmukhi number 5.
03234 - Kannada ddha.
03603 - Thai no nen.
04322 - Georgian cil.
04688 - Ethiopic qha.
05107 - Cherokee yu.
05732 - UCAS chee (Carrier).
34855 - Chinese moth.